A Geometric Approach to Classification of Trash in Ginned Cotton
نویسندگان
چکیده
This paper discusses the use of geometric approach to classify different types of trash (non-lint, non-fiber material) in ginned cotton. Pieces of trash can have complicated shapes, so we would like to find a good approximating family of sets. Which approximating family is the best? We reduce the corresponding optimization problem to a geometric one: namely, we show that, under some reasonable conditions, an optimal family must be shift-, rotationand scale-invariant. We then use this geometric reduction to conclude that the best approximating low-dimensional families consist of sets with linear or circular boundaries. This result is in good agreement with the existing empirical classification of trash into bark1, bark2, leaf, and pepper trash. A practical problem: brief description. The main use of cotton is in textile industry; for that purpose, we only need cotton fiber called lint. Mechanical harvesters collect fiber together with the seeds. To separate lint from the seeds and from other non-lint material, a special process called ginning is used. Ginned cotton consists primarily of lint, but some non-lint material (trash) is left. For the further textile processing, it is important to know how much trash of what type is left. In principle, it is possible to detect the amount and type of trash by visual inspection, because trash is usually of different color than the whitish lint and is thus clearly visible. The problem with visual inspection is that the visual inspection of all 15 to 19 million bales of cotton annually produced in the USA is a very time-consuming and expensive process. It is therefore desirable to develop an automatic system for the analysis of trash in ginned cotton (see, e.g., [Lieberman et al. 1994], [Lieberman et al. 1997]). Since trash is clearly visible on the lint background, it is natural to take a photo of a cotton bale, and then run a computer program to analyze this photo. Our goal is to separate trash from lint; since trash is of different color than the lint, we can ignore the details about the intensities of different pixels and use a threshold on intensity to transform the original image into a black-and-white one: points in which the intensity is above the threshold are treated as white (i.e., as lint), and points in which the intensity is below the threshold are treated as black (i.e., as trash). As a result, we get a black-and-white picture in which several pieces of trash are present on the white background. Pieces of trash can have complicated shapes. The user needs a simple classification of these shapes. A natural way of classifying different shapes is to describe several simple approximate shapes, and then to classify a given piece of trash based on which simple shape it resembles most. So, to develop a good classification of trash in cotton, we need to find a good approximating family of sets. Because of the large volume of cotton processing, even a small gain in classification quality can lead to a large economic benefit. It is therefore desirable to look not simply for a good approximating family of sets, but rather for a family which is optimal in some reasonable sense. Of course, the more parameters we allow, the better the approximation. So, the question can be reformulated as follows: for a given number of parameters (i.e., for a given dimension of approximating family), which is the best family? In this paper, we use a geometric formalism developed in [Kreinovich et al. 1999] and [Wolff et al. 1999] to formalize and solve this problem. Formalizing the problem. In this formalization, we will, in effect, follow [Kreinovich et al. 1999] and [Wolff et al. 1999]. The pieces of trash are usually smooth lines or areas with smooth boundaries, so it is reasonable to restrict ourselves to families of sets with analytical boundaries. By definition, when we say that a piece of a boundary is analytical, we mean that it can be described by an equation F (x; y) = 0 for some analytical function F (x; y) = a+ b x+ c y + d x2 + e x y + f y2 + : : : So, in order to describe a family, we must describe the corresponding class of analytical functions F (x; y). Since we are interested in families of sets which are characterized by finitely many parameters (i.e., in finite-dimensional families of sets), it is natural to consider finite-dimensional families of functions, i.e., families of the type fC1 F1(x; y) + : : :+ Cd Fd(x; y)g; where Fi(z) are given analytical functions, and C1; : : : ; Cd are arbitrary (real) constants. So, the question becomes: which of such families is the best? When we say “the best”, we mean that on the set of all such families, there must be a relation describing which family is better or equal in quality. This relation must be transitive (if A is better than B, and B is better than C, then A is better than C). This relation is not necessarily asymmetric, because we can have two approximating families of the same quality. However, we would like to require that this relation be final in the sense that it should define a unique best family Aopt (i.e., the unique family for which 8B (Aopt B). Indeed: If none of the families is the best, then this criterion is of no use, so there should be at least one optimal family. If several different families are equally best, then we can use this ambiguity to optimize something else: e.g., if we have two families with the same approximating quality, then we choose the one which is easier to compute. As a result, the original criterion was not final: we get a new criterion (A new B if either A gives a better approximation, or if A old B and A is easier to compute), for which the class of optimal families is narrower. We can repeat this procedure until we get a final criterion for which there is only one optimal family. The exact shape depends on the choice of a starting point, on the orientation of the camera, and on the choice of the zoom. It is reasonable to require that if we change the starting point, the orientation, or the zoom, the relative quality of different approximating families should not change. In other words, it is reasonable to require that the relation A B should not change if shift, rotate, or scale the image; i.e., the relation A B should be shift-, rotationand scale-invariant. These requirements can be formalized as follows: Definition 1. Let d > 0 be an integer. By a d-dimensional family, we mean a family A of all functions of the type fC1 F1(x; y) + : : :+ Cd Fd(x; y)g; where Fi(z) are given analytical functions, and C1; : : : ; Cd are arbitrary (real) constants. We say that a set is defined by this family A if its border is described by an equation F (x; y) = 0, with F 2 A. Definition 2. By an optimality criterion, we mean a transitive relation on the set of all d-dimensional families. We say that a criterion is final if there exists one and only one optimal family, i.e., a family Aopt for which 8B (Aopt B). We say that a criterion is shift(corr., rotationand scale-invariant) if for every two families A and B, A B implies TA TB, where TA is a shift (rotation, scaling) of the family A. Proposition. ([Kreinovich et al. 1999], [Wolff et al. 1999] Let d 4, let be a final optimality criterion which is shift-, rotationand scale-invariant, and let Aopt be the corresponding optimal family. Then, the border of every set defined by this family Aopt is either a straight line interval, a circle, or a circular arc. Discussion. The only shape which actually bounds a 2-D set is a circle which bounds a disk. So, as a result of this proposition, we have the following trash shapes: straight line intervals, circular arcs, and disks. When the disk is small, we can view it as a point, which leads us to the fourth possible approximate shape of cotton trash: points. This classification is in perfect agreement with the existing empirical classification of trash into: bark1 (approximately circular arcs), bark2 (straight line segments), leaf (disks), and pepper trash (points). The names of these types of trash come from their physical meaning, with the only exception of pepper trash which refers to broken or crushed pieces of leaf. Practical application. We have used this geometric classification to develop a prototype system for classifying trash. In our system, images (640 480) are acquired using a 3-chip CCD Sony color camera. The imaging hardware consists of a Matrox IM-1280 imaging board and CLD acquisition board. The pixel resolution is 0.13 mm (0.005 inches). The acquired images are flat field corrected for spatial illumination nonuniformity. Each acquired color image (RBG) is converted into hue, luma (intensity), and saturation (HLS) color space (see, e.g., [Russ 1994]), and a threshold on intensity is used to create a black-and-white image. The system uses intelligent pattern recognition techniques (see [Siddaiah et al. 1999], [Siddaiah et al. 1999a]) to detect different shapes. Practical results. The resulting systems achieves a 98% correct classification of trash – a much higher percentage than the previously known methods. Open problem. We described optimal 4-D approximating families. These families give a rather crude description of the actual trash shapes. To get a better understanding of the trash, we may need better approximation, so we may need to use more parameters in the approximating family of sets. Here comes an open problem: we know the optimal 4-dimensional families, but we still need to find out the optimal 5-, 6-, etc.dimensional families for locating trash in ginned cotton. Acknowledgments. This work was supported in part by NASA under cooperative agreement NCC5-209, by NSF grant No. DUE-9750858, by United Space Alliance, grant No. NAS 9-20000 (PWO C0C67713A6), by the Future Aerospace Science and Technology Program (FAST) Center for Structural Integrity of Aerospace Systems, effort sponsored by the Air Force Office of Scientific Research, Air Force Materiel Command, USAF, under grant number F49620-95-1-0518, and by the National Security Agency under Grant No. MDA904-98-1-0561.
منابع مشابه
High Speed Trash Measurements
This paper discusses the identification of trash objects in cotton using machine vision-based systems. Soft computing techniques such as neural networks and fuzzy inference systems can classify trash objects into individual categories such as bark, stick, leaf, and pepper trash types with great accuracies. High speed trash measurements, enables the implementation of these techniques for on-line...
متن کامل2 Preliminary Studies of Cotton Non-lint Content Identification by Near- Infrared Spectroscopy
The high demand for cotton production worldwide has presented a need for its standardized classification. There currently exists trained classers and instrumentation to distinguish key cotton quality parameters, such as some trash types and content. However, it is of interest to develop a universal instrumental method with higher trash identification specificity. A program was implemented to de...
متن کاملAutomation in cotton ginning
This paper discusses a framework for implementation of a machine vision-based system for on-line identification of trash objects commonly found in cotton. Soft computing techniques such as neural networks and fuzzy inference systems can classify trash objects into individual categories such as bark, stick, leaf, and pepper trash types with great accuracies. This identification of trash objects ...
متن کاملBotanical Trash Mixtures Analyzed with Near-Infrared and Attenuated Total Reflectance Fourier Transform Spectroscopy and Thermogravimetric Analysis
Botanical cotton trash mixed with lint reduces cotton’s marketability and appearance. During cotton harvesting, ginning, and processing, trash size reduction occurs, thus complicating its removal and identification. This trash causes problems by increasing ends down in yarn formation and thus processing efficiency. The Uster® High Volume Instrument (HVI) and Shirley Analyzer are extensively use...
متن کاملEvolution of Video Color/trash Processing at Southwestern Cotton Ginning Research Laboratory
The status of video cotton trash and cotton color measurement work at Southwestern Cotton Ginning Research Laboratory (SWCGRL) is presented. New accomplishments are in the areas of video measurement of absolute color in CIE XYZ and Hunter’s Rdab color spaces and improvements of bark trash type identification.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999